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Abstract 

We study two models of population with migration. We assume that we are given 
infinitely many islands with the same number r of resources, each individual consuming 
one unit of resources. On an island lives an individual whose genealogy is given by a 
critical Galton- Watson tree. If all the resources are consumed, any newborn child has to 
migrate to find new resources. In this sense, the migrations are constrained, not random. 
We will consider first a model where resources do not regrow, so the r first born individuals 

CNJ remain on their home island, whereas their children migrate. In the second model, we 

assume that resources regrow, so only r people can live on an island at the same time, the 
supernumerary ones being forced to migrate. In both cases, we are interested in how the 
population spreads on the islands, when the number of initial individuals and available 

^vq resources tend to infinity. This mainly relies on computing asymptotics for critical random 

walks and functionals of the Brownian motion. 
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1 Introduction 

y—i The incentive for this work is the series of three papers by Bertoin (see [4, 2, 3] and references 
therein) considering population models with neutral mutations, the latter occurring randomly. 

y—i Mutations can also be viewed as migrations from an island to another, the individuals living on 
the same island being exactly those with the same alleles. While considering random mutations 
is natural, it is just as legitimate to assume that migrations do not happen randomly but are 
constrained: individuals will migrate when they need to find new resources to survive. 

We shall study two models, which can be loosely described as follows. Individuals live on 
different islands, and each consumes one unit of resources to live. If one is born on an island 
where the resources have run out, they will migrate to a virgin island (that is, where no one 

^ has ever lived) and found their own colony. We shall thus assume that there are infinitely many 
virgin islands, and moreover, that each contains the same quantity of resources, denoted r in 
the following. Our two models differ only by one point: in the first one, the resources do not 
regrow, whereas they do in the second. In other words, in the first model, when r individuals 
have lived on an island, the next ones being born on this same island will migrate. In the 
second model, when r individuals live on an island at the same time, and more individuals are 
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born at that time, the latter have to migrate. Our models are reminiscent of the virgin island 
model of Hutzenthaler [10], though once again, in our case, migrations are not random. 

Our goal is to study how the population spreads on the different islands, that is, compute 
the number of islands where k people live, for HI We do not wish to give a dynamical 
version of this, and we will thus wait until the population is extinct, and compute the number 
Z k of islands where k people have lived. To this end, we shall study the measure 

This is thus probably a good time to be more precise and introduce the quantities we will 
consider. 

For both models, our study shall be fourfold. The first step is to construct precisely the 
object of interest. This will be called the tree of isles, in reference to the tree of alleles of [4], 
and it will encode all the relevant information. This is a multitype tree, and when referring to 
it, we shall adopt an unusual language. A vertex of this tree corresponds to an occupied island 
and we will thus call its vertices "islands". A descendant of an island J is an island founded by 
a migrant coming from J, so we call "colonies" these descendants. Finally, the type of an island 
will be the total number of individuals who lived on it, and we shall thus instead refer to this 
type as its "population". 

The construction of the tree of isles is deterministic from a given finite tree. Consider the 
latter as the genealogical tree of the first existing individual, whom we shall impiously call 
Lucy 1 . Her children may have to migrate to find new resources - obviously, this depends on 
the model. They move each to a different island and found their own colony, from which other 
individuals may migrate, and so on. Then, the root of the tree of isles corresponds to the home 
island of Lucy, its colonies to the islands founded by the migrants from this island, and so on. 
We conclude the construction of the tree of isles by attaching to each island its population. 

The second step of our study is to find a relevant way to compute the population and 
number of colonies of the home island of Lucy, i.e. of the root of the tree of isles. We will be 
able to encode this quantities through an exploration process of the genealogical tree of Lucy, 
and for each model, we shall give an algorithm to construct this process in such a way that this 
information can be easily read on it. For the first model, the construction is simple since it is 
just the usual breadth-first search algorithm. Unfortunately, for the second model, we are led 
to writing quite abstruse definitions, but the figures should be enough to enlighten the reader. 

The third step is now to add some randomness, in taking the genealogical tree of Lucy to 
be a Galton- Watson tree. Let us fix once and for all a reproduction law p, which is critical, i.e. 
has mean 1, and a second moment < a 2 < +oo. In particular, this tree is a.s. finite, and we 
can construct the tree of isles w-wise. It will be clear that the branching property of the initial 
tree carries on to to the tree of isles, which turns out to be a multitype Galton- Watson tree. 
This explains in particular why, in the previous paragraph, we were only interested in encoding 
the population and number of colonies of the root of the tree of isles: once we know this is a 
multitype Galton- Watson tree, these quantities are indeed enough to characterize it. 

This ends what we can do starting from one individual and a fixed number of resources. 
Our last step is to pass to the limit, in different senses which will be made precise later. To this 

^^Which is the name given to the earliest known hominin, before the subsequent discovery of the earlier 
remains of "Ardi". Another reason for this choice is that lately, the literature has grown quite fond of the name 
"Eve cluster", designating thus a group of individuals, not a single one. 
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end, we will start from a number N, meant to go to infinity, of initial individuals spread on N 
different islands. Their genealogies are assumed to be given by i.i.d. Galton- Watson trees with 
reproduction law p. We wish to rescale the number of resources so that, on every island, there 
is a probability of order 1/N that some people migrate. This is similar to the rate of mutation 
considered in [4], or what is classically done in the Wright-Fisher model when considering rare 
mutations. In this case, there shall thus be a Poisson number of these initial islands which will 
have colonies. Clearly, this implies that there should be a number r = of resources tending 
to infinity, but the precise speed actually depends on the model. The main part of the work will 
be to compute the empirical measure describing how the population is spread on the different 
islands, more precisely its limit after a proper rescaling. We will also provide a direct way to 
construct this limiting measure, which enlightens the structure of the tree of isles. 

Finally, for the first model, we will provide an extra result, concerning the limit of the tree 
of isles in terms of trees. In other words, the goal here is to keep track of the genealogy of the 
islands, which is lost in the mere computation of the empirical measure. A similar result could 
be obtained for the second model, but it would just be a technical modification of the first one, 
and would not bring, we think, more understanding of the model. 

The paper is organized in four sections following this introduction. The second is devoted 
to defining the objects we will consider and explain the techniques we shall use. In the third, we 
will recall some definitions about exploration processes, most of it being rephrasing of known 
(but quite sparse in the literature) folklore. The fourth and fifth section then deal each with 
one model. The second model is more involved than the first one and we can thus only advise 
the reader to follow the given order. We however hope to show that the second model has an 
interest on its own, on the one hand because it is probably more natural to consider regrowing 
resources, and on the other hand because it leads to the computation of nice asymptotics for 
random walks and formulas for the Brownian motion. 

Acknowledgments The author would like to thank Jean Bertoin for suggesting to study this 
topic, and for his always accurate insights. Heartily thanks also to Lorenzo Zambotti, Amaury 
Lambert and Olivier Henard for stimulating discussions, as well as to Marc Yor for providing 
several less known results on Bessel processes. 

2 Definitions and techniques 
2.1 Trees 

We shall always consider rooted and ordered trees, thus having natural notions of ancestors, 
descendants, or individuals being on the right or the left of another. We shall adopt the following 
classical formalism. First, define the universal tree 

11= UN" 

ngZ+ 

with N° = {0}, and let II* = U\{0}. If u £ N k , we say that u is at generation k and write 
\u\ = k. The root is 0, and the children of an individual u = (ui, . . . , «&) at generation k are 
uj := (ui, . . . , Uk,j) for j £ N. We call a tree rooted at a subset T of IX such that 

• 0£T; 
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• if v G 7 and v = uj for some u G IX and j G N, then mgU; 

• for every w G T, there exists a number /c u (T) G Z + such that w j G T if and only if 

The quantity fe u (T) is the number of children, or descendants, of u [k stands for "kids"). A tree 
rooted at u G IX is a subset 7 of IX which can be written as u7', where 7' is a tree rooted at 0. 
If no detail is given, a tree will always be assumed to be rooted at 0, though we will just call 
tree a tree rooted at another vertex if the context is clear. A Galton- Watson tree (with types 
or not) can thus be defined in a natural way, see [7, 14]. We let s(7) to be the size of T, that 
is, its cardinality. A labeling of a tree 7 is a bijection between 7 and {1, . . . , s(7)}, and, given 
a labeling, we may refer to "vertex i" instead of "the vertex labeled i". 

We call these trees discrete and will also use, for the second model, continuous trees, that is 
trees whose branches have lengths, interpreted as life-lengths of the individuals: the latter give 
birth to all of their children just before dying. Let us not dwell on an obscure formal definition, 
since once again, figures shall make things lucid. 

A multitype tree A is a tree such that a type A u is attached to each vertex u of the tree. 
The types here will be positive, and we will give type to the vertices which do not belong to 
A, i.e. define A u = for u G U\A. In particular, the set of vertices with non-zero type is a 
tree, and we will thus say that u G A if A u ^ 0. 

2.2 Construction of the tree of isles 

Let us start by recalling that the trees involved in the first model will be discrete, whereas they 
shall be continuous for the second model. Let us fix a tree T, interpreted s the genealogical tree 
of an individual funding an island J. For each model, we will define (respectively in Section 4.1 
and 5.1) the migrant children of a tree T, which is just a subset of vertices of T. We denote 
C(T) their number, which is, in our terminology, the number of colonies of J, though we may 
also refer to it as the number of colonies of T. These individuals are roots of disjoint 2 trees 
which we will denote T^ igr , . . . ,T^^\ After we cut off these subtrees from T, we obtain a 
pruned tree, which describes the genealogy of this individual restricted to its offspring living 
on J. The size of this pruned tree shall be denoted P(T), and is the population of J, or of T. 
We start with the genealogical tree 7 of Lucy. 

This notation obviously depend on the model, but we shall not specify it: clearly, it refers 
to the first model in Section 4 and to the second one in Section 5. However, it depends on the 
number of resources we consider, and we shall add an index r if necessary, writing P r (7) and 
C r (7) to specify that we deal with r resources. 

The tree of isles of 7, A, can then be constructed recursively as follows. Recall that it 
is a multitype tree, whose vertices are called "islands", types "population" and descendants 
"colonies". 

• First, take X = {(0,T)}. 

• At each step, if X = then stop. Otherwise, pick some {v,T) in X, and remove it from 
X. Add island v to A, and give it population P(T) and C(T) colonies (if any). Then, for 
i G {1, . . . , C(T)}, add (vi, ) to X, and go to the next step. 

2 This will be clear from the precise definition of these migrant children. 



4 



It should be clear that this algorithm provides a multitype tree, which is, by definition, the tree 
of isles constructed from 7. 

It is already worth keeping in mind the following properties of the tree of isles, which shall 
be proven later: 

• its construction is deterministic; 

• the information which is relevant to our study is encoded in the tree of isles; 

• if the initial genealogical tree is a Galton- Watson tree, then the tree of isles is a multitype 
Galton- Watson tree. 

Let us finally take advantage of this section to fix some notation. A deterministic tree shall 
be denoted 7, and its tree of isles A (from French "arbre", tree). A random tree (with no type, 
and it will always be a Galton- Watson tree) will be denoted T, and the corresponding tree of 
isles A. Note once again that the latter is just obtained deterministically from T: each T(u) 
gives rise to a A(w). 

2.3 Notation and empirical measure 

Let us first introduce some notation. 

• Ck is the space of nonnegative continuous functions with compact support in (0, +oo); 

• M + is the space of nonnegative Radon measures on (0, +oo), endowed with the vague 
topology (i.e. the space of test functions is Ck)', 

• we denote =>- the convergence in M + , and ^-l the usual convergence in distribution of real 
random variables; 

• for CP G M + and / G Ck, (CP, f) is the integral of / with respect to CP; 

• finally, for notational simplicity, in the whole document, we do not write the integer parts. 

As mentioned in the introduction, we want to start with a large number of Galton- Watson 
trees with reproduction law p, which has mean 1 and a second moment < a 2 < +oo. We 
will thus pick (Q,^, P) a probability space where an i.i.d. family (T fe ) fc > of those trees can 
be defined. We can then construct, for each model, the corresponding trees of isles (A fe (r)) fc > 
corresponding to r resources, which are also i.i.d. since the construction of the tree of isles is 
deterministic. 

Remember that we wish to have the number N of initial islands, as well as the number 
of resources r N on each island, tend to infinity, and to study the empirical measure of the 
population. We thus define, for each r, n G N, 



In other words, the mass of CP„(r) at I G N is the number of islands where / people have lived, 
when starting from n islands and considering r resources. We are interested in the convergence 
of 7n(j'n), but to obtain a nontrivial limit, we will need to rescale it, by setting 



n 



?n(r) = E E <W)1 



{A£(r)^0}- 



k=lu&l 



n 



^ ) W = EE^ W /^ 1 



{A£(r)^0}- 



k=l u&l 
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This rescaling actually implies (see Lemma 4.6) the tightness of (CP^r (r^)) in M + . Clearly, 
CP^^r) is just the sum of n independent copies of the reference measure 
We also define T := T 1 and A(r) := A 1 (r), so (T fc ) (resp. (A fc (r))) is a family of i.i.d. copies of 
T (resp. A(r)). Finally, we let, for p > 1, % > 0, 

Z ljP (r) = #{m e IX, |u| = z, A u (r) = p}, 

so we can rewrite 

p>l i>0 

what will turn out to be very useful to write an equation solved by the cumulant of 7^\r^): 
recall that the cumulant of a random measure CP on M + is 

«(/) = - In E(exp -<?,/» 

for / G C#r, and that the knowledge of for every / e characterizes CP. 

2.4 General technique of proof 

To compute the limit of (CP^(rjv)), we will proceed in several steps. The first is to prove the 
tightness, which is easy. The second is to prove the uniqueness of the limit. To this end, we first 
give an equation solved by the cumulant %(/) of CP^(rjv), which is obtained thanks to the 
branching property of the tree of isles. We then pass to the limit in this equation to obtain an 
equation solved by any subsequential limit of kn{J), which is readily proved to have a unique 
solution. This guarantees, with the tightness, the convergence of (CP^(r/v)). 

The main part of this work is to compute the limit of the unknown quantity involved in 
the aforementioned equation, namely the population and number of colonies of the root of 
the tree of isles. As we mentioned, these quantities can be read on a random walk (which is 
the exploration process of the genealogical tree of Lucy), and our work will then boil down to 
finding limits of functionals of random walks. This is undoubtedly the most technical part of 
the work; however, if we leave aside the necessary computations, most of the results should not 
come as a surprise. 

3 Exploration processes 

In this section, we shall provide a general way to build the exploration process of a tree. The 
goal of this construction is that, if this tree is a Galton- Watson tree, then its exploration process 
is a random walk. These matters are classical, and we shall not dwell on the proofs. 

3.1 A general construction of the exploration process 

Consider a finite tree T, with size s = s(T). There are several ways to label it, i.e. write a 
bijection between the set of vertices and {1, . . . , s}, the most common being by breadth-first 
and depth-first search. From a labeling, we may construct the exploration process as follows: 
denote fcj the number of children of (the individual labeled) i. Then the exploration process is 
given by So = and 

S, = (fci -!) + -•- + {hi -1), z = l...s, 
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and we let Si — — 1 for i > s for the sake of definiteness. We will extend this process to the 
whole of 1R + by interpolating linearly for the first model, whereas we will instead set St = S\tj 
for the second model. When 7 is a Galton- Watson tree with reproduction law p, and when it 
is labeled by breadth-first or depth-first search, it is well-known that (S n ) is a left-continuous 
random walk, with step distribution p, absorbed at —1 at time s, where p{k) = p{k + 1) for 
k > -1. 

Let us now explain a more general way to label T so as to conserve this last property. Let 
T be a discrete or continuous tree. We say that £ is a line in T if £ is a set of vertices such that 
every path from the root to a leaf contains at most one vertex of £. We now define 7i the tree 
pruned at level £, as follows. 

• In the discrete case, 7i is the connected component of the root when we delete the 
descendants of the individuals of £: see Figure 1. 

• In the continuous case, 7g is the connected component of the root when we delete the 
descendants of the individuals of £, but keep the life-lengths of all the individuals in £: 
see Figure 2. 




Figure 1: A discrete tree 7, a line I (the circled vertices), and the tree 7^ pruned along this line. 

We will label our tree T by giving labels 1 to s{7) in this order, the label 1 going to the 
root. To explain which vertex we label i + 1 after we have given labels 1 to i, we use what we 
call a Markovian rule. 

Definition 3.1. Let L be the set of nonempty lines in T. A Markovian rule 31 is a mapping 
from £ to the set of vertices of 7 such that, for every £ e £, e £ and H(£) depends only 
on 7 i. 

The labeling of 7 corresponding to this rule is then given by the following algorithm. 

1. At step 0, take £ to be the root of 7, and go to step 1. 

2. At step i: 

• if £ = 0, then stop; 

• if not, choose the vertex v according to the rule "R, i.e. take v = Jl(£). Give v the 
label i. Then remove v from £, and add to £ the children of v; 
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Figure 2: ^4 continuous tree 7, a line £ (the circled vertices), and the tree 7i pruned along this 
line. Note that we cut the descendants of the individuals in £, not the individuals in £, so the 
right-most individuals still remain in 7^. 



• then go to step i + 1. 

It should be clear that when the algorithm stops, the tree is labeled. Let (S n ) to be the 
exploration process corresponding to this labeling. We claim the following. 

Lemma 3.2. • The size o~(7) of the tree is the hitting time of —1 by (S n ). 

• If 7 = T is a (discrete or continuous) Galton- Watson tree with reproduction law p, then, 
at each step of the labeling algorithm, conditionally on i, the subtrees rooted at the vertices 
of £ are i.i.d. with the same law as T. In particular, (S n ) has the law of a random walk 
with step distribution p, absorbed when first hitting — 1. 

Proof. The first part is classical and easily proven by induction. The second part is just the 
observation that, at each step of the labeling algorithm, the fact that we choose a Markovian 
rule ensures that the set of edges above £ is a stopping line, in the terminology of [6] (see also 
the nice informal explanation in [1]). This ensures that the branching property holds for the 
subtrees rooted at £, i.e. that conditionally on £, the subtrees rooted at the vertices of I are i.i.d. 
with the same law as T. This clearly implies the statement about the exploration process. □ 

This result obviously encompasses the two known cases mentioned. The rule in the breadth- 
first search case is "pick the leftmost vertex at the lowest generation in t\ and "pick the smallest 
vertex in the lexicographical order in £" in the depth-first search case. We may make up a lot of 
valid rules, and this construction will be mostly useful when we study the second model, since 
the rule then is quite involved. 

3.2 Death- first search algorithm 

Let us now introduce a particular labeling of a continuous tree by what we shall call "death-first 
search". Consider a continuous tree T, such that two events (birth or death) do not occur at 



S 



the same time. Our rule is, for a line I, to pick the individual in £ who dies first. Since keeps 
track of the life-lengths of the individuals in £, this is clearly a Markovian rule. See Figure 3 
for an example, and Figure 4 for the corresponding exploration process (S n ). 
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Figure 3: Labeling of a tree by death- first search. 
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Figure 4: Exploration process corresponding to the labeling of the tree in Figure 3. 

Now, let = t < 7~i < • • • < r k < r k+ i = +oo be the times of the successive events of birth 
or death. Then 1 = 1 + So is the number of individuals living on [t ,ti). Then the number 
of children of 1 is 1 + Si, so the number of people alive on [t\,T2) is 1 + Si. The number of 
children of 2 is 1 + (£2 — Si), so the number of individuals alive on [r 2 , r 3 ) is 

death of 2 

1 + Si + ^1 + 1 + S 2 -Si =1 + 52. 

v v ' v v ' 

individuals alive at r" number of children of 2 

By a clear induction, one readily sees that 1 + Si is the number of individuals alive on [n, Tj+i), 
for i G {0, . . . , k}. We will use a variation of this algorithm and this observation when studying 
our second model. 
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4 First model: fossil resources 



Let us recall informally our first model, which involves fossil resources, i.e. non-regrowing 
resources. Each island J contains a number r 6 Nof resources, which are consumed by the first 
r individuals living on J. Every new individual born on J after these r first will migrate, each 
to a new virgin island with r resources, to found its own colony. 

4.1 Tree of isles 

Let us start by defining the tree of isles A{r) from a discrete tree T. According to Section 2.2, 
to this end, all we need to do is to explain how we choose its migrant children, which should 
be clear from the informal description of the model. Since several children may be born at the 
same time, the only issue is to choose which children migrate after the resources on the island 
have been exhausted, and we will pick arbitrarily the right-most children. 

To make this precise, if s(7) < r, then 7 has no migrant children. Else, label 7 by breadth- 
first search. Then the migrant children are the individuals with label in {r + 1, r + 2, . . . }, which 
are descendants of individuals with label in {1, . . . , r}. See Figures 5 and 6 for examples. 

13 14 15 




Figure 5: A tree labeled 7 by breadth-first search. Its migrants for r = 5 are shown in a square, 
the individuals in a circle remaining on the island. Here P r (7) = 5 and C r (7) = 4. 

Let us go on with our plan: we wish to read the population and number of colonies of the 
root of A(r) on the exploration process of 7. To this end, we shall naturally consider (S n ) 
the exploration process associated to the labeling by breadth-first search, see Figure 7 for an 
example. We let 

?oo = inf {t > 0, Si = -1}, inf := +oo, 
the hitting time of —1 by (S n ). We now claim the following. 
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Figure 6: The tree of isles A{r) obtained from the tree 7 of Figure 5, for r = 5. 
Lemma 4.1. The following equalities hold: 

Pr(7) = <Joo A T , C r {7) = 1 + S^ Ar . 

Proof. Recall from Lemma 3.2 that = s(T). Hence, when there are no migrants, i.e. s(7) = 
Soo < r, then P r (7) = s(7) = ^ = ^Ar and C r (7) = = 1 + ^ = 1 + 5 SooAr . 

When there are migrants, i.e. s(7) = ^ > r, then by definition P r (T) = r = ? M Ar. Now, 
by an easy induction as in Section 3.2, it is easy to check that for each i G {1, . . . , s(T)}, 1 + Si 
is precisely the number of children of the individuals with label in {1, . . . ,i} whose label is in 
{i + 1, i + 2, . . . }. Hence, by definition, C r {7) = 1 + SV = 1 + S^Ar- □ 




Figure 7: The exploration process obtained from the tree 7 of Figure 5. Check that, for r = 5, 
?oo A r = 5 = P r {7) and 1 + S r = 4 = C r {7). 

Let us use the notation of Section 2.3, so T is a Galton- Watson tree with reproduction 
law p and A(r) its tree of isles when we consider r resources. Instead of considering the walk 
(S n ) as being constructed from T, we rather take (S n ) to be an actual random walk with step 
distribution p, defined on all of Z + . We thus just have to replace the equalities in Lemma 4.1 by 
equalities in law. Our objective now is to prove that A(r) is a multitype Galton- Watson tree. 
Taking this for granted, the only information we need to characterize it is the law of the type 
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(i.e. population in our terminology) and number of children (or colonies) of the root, i.e. the 
law of the couple (P r (T), C r (T)). We let 7r r the law of P r (T), and 7 r the law of C r (T) knowing 
that P r (T) = r. By definition, C r (T) = when P r (T) < r, so the law of (P r (T), C r (T)) is 
indeed specified by 7r, r and j r . 

Lemma 4.2. T/ie tree of isles A(r) z's a r-type Galton-Watson tree, described as follows. 

• The population of the root is chosen according to 7i r ; 

• the number of colonies of the islands of population r has law 7 r; and each colony chooses 
independently its population according to n r ; 

• the islands of population 1, . . . , r — 1 do not have any colonies. 

Proof. Let us take a look at the algorithm defining the tree of isles A(r). The root has, 
by definition, population given by ir r . If A$(r) < r, then T has no migrant children and 
A„(r) = for u G IX*. Else, &%(r) = r, and T has a number of migrant children given by 
j r . Conditioned on these migrant children, the second statement of Lemma 3.2 tells that the 
subtrees T^ igr , . . . , T^j/ are independent with the same law as T. A simple induction then 
yields the result. □ 

A straightforward corollary of this result concerns (Z ip (r), p = 1, . . . , r)j> , as defined in 
Section 2.3. 

Lemma 4.3. The process (Z i>p (r), p = 1, . . . , r)j> is a r-type Galton- Watson process, described 
as follows. 

• The r-tuple (Z 01 (r), . . . , Z r {r)) has the same law as (l{p r (T)=i} ; ■ ■ , l{p r (T)=r})/ 

• the number of children of the individuals of type r has law 7 r , and each child chooses 
independently its type according to 7r r ; 

• the individuals of type 1, . . . , r — 1 do not have any children. 



4.2 Rescaling 

Let us keep on with our program. We now start from independent islands and r^ resources, 
both meant to tend to infinity. We wish to rescale r^ so that each island has a probability of 
order 1/N to have colonies. From Lemma 4.1, 

P(C rjV (T) > 0) = P(^ > r N ) ~ \ — 2 ^- (1) 



The last equivalent stems from well-known facts about hitting times for random walks, see e.g. 
[21], p. 382. We shall thus assume that, for some c > 0, 

lim ^ = c > 0. (2) 
Let us introduce some notation. We denote for simplicity 

V ira z c 
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Let 

P N = P rN (T) , C N = C rN (T) , p N = F(P N = r N ), 
their conditioned laws 

ttW = L(P N \P N < r N ), 7 (7V) = £(Cn\Pn = r N ), 
and their rescaled versions 

5f<">=£(^|p„< rj v), 7 C) = £ (^|p K = ri ,). 

Let us first study the limit, in different senses, of the two latter. We define here 

and 

9 = L(y/caWf), P(W 1 + G dx) = xe-^l^x,} dx. 

The notation Wf 1 " just stems from the usual notation (Wt + )t6[o,i] for the standard Brownian 
meander, and Wi is its tip, whose law is the Rayleigh law xe~ x2 ^ 2 t^ x>0 y dx. 

Lemma 4.4. As N — > +00, 

A 

and moreover, the convergences 
hold, respectively in M + and in law. 

Proof. The first statement is just (1) written with a non-strict inequality. For the first conver- 
gence, note that more generally, (1) and (2) ensure that, for every a > 0, 

\ \ r+00 1 



fa 2 J a x 3 / 2 
so one readily sees by standard approximations that 

MM ~ G dxW X -L dx. 



,iV 2 / 2x 3 / 2 

Recall also from Lemma 4.1 that P/v has the same law as ^ A rjv, and thus, for / G C^, 

+00 

/(x)iV^(dx) 

/•(rjv-l)/^ 2 

P(Pjv < r^v) -1 / f(x)NF(P N /N 2 e dx) 

Jo 

r(r N -l)/N 2 

(I-Pn)- 1 / /(x)AfP(^/iV 2 Gd3;) 
Jo 

(1 - PnY 1 / /(x)iVP( ?00 /iV 2 e dx) + / /(x)7VP( ?oc /iV 2 e dx) 



-> / /(x)//(dx) 
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using the computation above and that the second term is easily seen to tend to since r^/N — > 
c. 

For the second convergence, note that Lemma 4.1 implies that 7W is the law of 



l + S rN 

N 



Coo > r N . 



It is well-known (see [5]) that a centered random walk with a second moment, conditioned to 
stay positive, converges to the Brownian meander (W t + ) te [o,i], and in particular 



>~>r N 



^>r N { 4 W+, 

whence the result follows after noticing (see e.g. [11]) that Wx has the Rayleigh law. □ 
4.3 Heuristics and result 

Before stating the result, we will, with the help of the previous results, discuss some heuristics. 
Consider the forest 

(K\r N ),...,K N (r N ))/N\ 

where dividing by N 2 means rescaling the population of every island by 1/N 2 . The islands 
with (rescaled) population 1/iV 2 , . . . , (r^ — l)/N 2 do not have colonies, whereas those with 
population r^/N 2 « c may 3 . We shall thus call the latter type of islands fertile. According to 
Lemma 4.4, the number of colonies of a fertile island is approximately f N, where f has law 9. 
We will then say that this island has fertility f . 

Now, by Lemma 4.2, the populations t\, . . . , tfjy of these f N islands are chosen independently. 
Each has a probability ~ A/iV to be fertile, and the population of any other island has law 
7T7v, and N-kn =>■ Xfi c . Hence, the measure 

m 

1=1 

is approximately a Poisson measure with intensity 

f A (// + 8 C ) . 

By the superposition property of Poisson measures, this can be reformulated by saying that the 
descendants of a fertile individual with fertility f are 

• either fertile, and there is approximately a Poissonian number with parameter f A of those; 

• or not fertile, and those contribute to the empirical measure as approximately a Poisson 
random measure with intensity f \fi c . 



3 And probably do: an island of population r^/N 2 does not have colonies only in the (rare) case where the 
founder of this island has precisely rjv — 1 descendants. 
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Finally, since the initial number of individuals is N, we may link them to a virtual ancestor of 
fertility 1. 

This invites us to introduce the following measure rj. We first construct a Galton- Watson 
tree T with fertilities 1 in (0, +00). The number of children of an individual with fertility f is 
Poisson with parameter fA, and each child has a fertility chosen independently according to 9. 
We start from an individual with fertility 1, and denote f u the fertility of u G T. 

Obviously, if we forget about the types, T has the same law as a Galton- Watson tree, 
constructed as follows: 

• the reproduction law of the ancestor is Poisson with parameter A; 

• the other individuals have reproduction law B, which is a Cox law: it is a mixture of 
Poisson laws, where the random parameter is chosen as X^/caWi, where we recall that 
\J~coWx has law 6. 

The point of this alternative construction is to note that X^JcaWi = ^j2/nWi~, so 7^ 5\ and 
has mean 1, so that T is a.s. finite. For matters concerning the construction of these variables 
and the measurability of the functions and variables we consider, we refer to [8, ch. Ill] and [1, 
ch. VI]. 

Finally, we define a measure 77 as follows: consider (z/ u (f), f > 0) ue u an i-i-d. family, consist- 
ing of collections indexed by 1R + of random variables, such that, for each and f > 0, v u {f) 
is a Poisson measure with intensity fA//. This can for instance be constructed using a family 
of i.i.d. Poisson processes indexed by II, see Section 4.5.2. Then we define 

Subtracting 6 C is just to take into account that the ancestor is a virtual one. We may now state 
our result. 

Theorem 4.5. Under the assumption (2), the sequence (O 3 ^ (Vjv)) converges in distribution in 
M + to a random measure with the same law as r\. 



4.4 Proof 

We first fix a sequence [tn] such that (2) is verified and proceed as explained in Section 2.4 by 
first proving the tightness. 

Lemma 4.6. The sequence (^jp(rtf)) is tight in M+. 

Proof. This fact can be proven as in Lemma 5 in [3]. Indeed, one can readily check that, 
since our test functions have compact support in (0,+oo), it follows from the tightness of 
(rjv), Id). But the latter is the total population of a Galton- Watson forest started from N 
ancestors, renormalized by 1/N 2 , and it is well-known that this converges to the total population 
of a Feller diffusion, see e.g. [15]. □ 

Our next step is to derive here an equation solved by the cumulant of T^(rAr), given by 

n N (f) = -lnE(exp-(T^ ) (r iV ),/)) = -AMnE (exp -(7 {N \r N ), /)) , 

where we recall that 7^\r N ) is the sum of iV independent copies of 7^ N \r N ). 

4 This is, once again, just another way to speak of types, but we would rather avoid the confusion. 
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Lemma 4.7. The cumulant solves the following equation 



exp-M/) = E (exp- (/ (^) + (3) 



/or even/ / G C#. 



Proof. Lemma 4.2 and the branching property show that (-^o,p( r Af))p=i,...,rjv has the same law 
as (l{p JV= i}, . . . , l{p JV =r Ar })) an d that, conditionally on py an d Cjv, (^i,-( r 7v))i>i is independent 
from Z 0> , and has the same law as the sum of independent copies of {Z^.{r^))i>Q. Hence, 
conditioning on P/v and Cat, we may write 



exp 



-n N (f)=E(exp-(?^\r N ),f)) 



N 

2\ 



E\exp-J2J2^ArN)f(p/N 

i>0p=l 



( / \Cn\ N 

E 



exp - £ Z 0:P (r N )f(p/N 2 ) x E exp - £ £ Z ijP {r N )f(p/N 2 

p=l \ i>0 p=l 

. N 



E ( exp - £ l {PN=p} f(p/N 2 ) x E(exp -^'(r,)) 

C N /N^ A 



= E (exp -f\P N /N 2 ) x E(exp -^(r^)) 
= E (exp -f(P N /N 2 ) x (exp -K N (f))°»' N ) N 

and the result follows. □ 

This lemma allows us to give an equation solved by any limit point of (kjv(/)) (which exist 
by Lemma 4.6). 

Proposition 4.8. For f G Ck, any limit point n(f) of (k/v(/)) zs the unique solution to the 
following equation 

(r+oo r+oo . . \ 

J (1 - e" /(x) )/i c (dx) + J (l - e - /(c) - K(/):c ) 0(dx)J . (4) 

Note that the first term on the right-hand side is the cumulant of a Poisson random measure 
with intensity /i c . 

Proof. We assume for simplicity that (kjv(/)) converges to some Let us investigate the 

behavior of the right-hand term of Equation (3). The expectation therein tends to 1, so we just 
have to study 

NE (l - e -/(^ 2 W(/)cViV) =NE ^ _ e -f(P N /N^ i {Pjv<rjy} ) 

+ NE ((l - e -/(^ 2 W(/)^/") l {Pjv=r]v} ) . 
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1. We may rewrite the first term on the RHS as 

F(P N < r N )NE ((l - e-K p "^)\t {PN<rN} ) = (1 - p N ) f °° (l - e"/W) N^ N \dx). 
Since a; i — > 1 — e~^ x > G Ck, Lemma 4.4 ensures that this tends to 

y (i- e - /(a:) )// c (dx). 

2. Now, the second term on the RHS is 

NE ((l - e -f(rN/N^ KN (f)C N /N^ 1{pjv=riv} ) 

= iV Piv y +00 (l - e-/^ 2 )-^/)*) ^W(dx) 

/•+oo 

:=iV m / ^(x)7 W (dx). 
The sequence (g^) converges pointwise to 

g ; ^ 1 _ e -/(<0-«(/)* 

and all the ^at's are bounded and uniformly Lipschitz-continuous. Since 7^ — > 6 by Lemma 
4.4, it is then straightforward that 

+oo r+oo 

g N (x)^ N \dx) -> / <7(x)0(dx). 



The result then follows after from the convergence Np^ — > A. 

3. It only remains to prove that the equation has a unique solution. Taking x = e~ K( ^ has the 
unknown, we may rewrite it 

x = A-BE(x a ^ cW i + ) } 
with B > 0. The function xH'i-ii i?E(x f7v/ ^ w/l+ ) has derivative 

1 + 5a v ^E(W/+x CT ^i + - 1 ). 

This last quantity is positive, whence the result follows. □ 

The last step is to check that the measure t? defined in Section 4.3 has the same law as the 
limit measure we just obtained, and to this end, one only need to prove the following. 

Proposition 4.9. The cumulant of the random measure rj solves Equation (4). 

Proof. We denote T(f) a tree constructed as in Section 4.3, but starting instead from an ancestor 
of fertility f, and let 77(f) be the corresponding measure. For g e Ck, consider 

0(f) =E(exp-<T7 + (5 c ,£)). 

Note that we count S c for the virtual ancestor to simplify the computations (in particular, the 

cumulant of 77 = 7/(1) is — In 0(1) —g{c)). Indeed, the branching property shows that in the tree 
T(f), conditionally on the number k of individuals at the first generation, which is Poissonian 
with parameter Af , each subtree rooted at the first generation accounts for a measure rf such 
that 
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• the rf's are independent from the measure ^(/o) + S c generated by the root, 

• rf has the same law as 77 (fj) + S c , where the fj's are i.i.d. with law 9. 

Hence, conditioning on the number of children at the first generation and using the exponential 
formula for Poisson measures, we have 

0(f) = E(exp -(770(f) + S c , g)) £ e~ Af i-jL / E(exp -(r](s) + 5 C , g))9(ds) 

= e- Af e-^E(exp-(r /0 (f),^))5: ^ (/%( a )0(d a ))* 

fc>o fc! ' 

= e - Af e- 9(c) exp -f A j (1 - e~ 9{s) )fx c (ds) exp ( Af jf 0(s)#(ds 
= exp - (g(c) + f A ^ + °°(1 - e^VW + Af (l - J^°° <f){s)6(ds 

This shows that f 

0(f)e 9(c) = (0(l)e 9(c) ) . 

Plugging this in the above formula readily shows that $ := — In 0(1) solves 

$ = g( c ) + \J c(l - e" s(s) )/i(ds) + \(l - J e ~ 9{c) - s{ *- 9{c)) 9(d,< 
so that $ — (?(c), which is the cumulant of i], also solves (4). □ 

4.5 Genealogy of the islands 
4.5.1 Introduction 

The arguments and heuristics mentioned in Section 4.3 should make it clear that a result 
concerning the genealogy of the island could be obtained, which is lost when considering merely 
the empirical measure 

We shall thus now give an idea of what the "genealogical tree" 
of the islands looks like at the limit when iV — > +00. Obviously, this tree is infinite, since e.g. 
we start from iV — > +00 islands. On the other hand, the heuristics of Section 4.3 suggest that 
the tree consisting only of fertile islands should be finite, more precisely be a Galton- Watson 
tree with the (critical) Cox reproduction law O, so that the whole genealogical tree of the 
islands has a.s. finite height (but infinite width). Inspired by Definition 1 in [4], we will now 
introduce the definition of a tree-indexed Continuous State Branching Process with types 5 . 

Let us define in the following the subset of M + of measures which integrate 1 at infinity. 
In particular, if fi G , we can rank the atoms of a Poisson measure with intensity /i in the 
decreasing order, which allows the following definition to make sense. If [i is finite, there is only 
a finite number of such atoms, and we shall always complete this decreasing sequence with an 
infinite sequence of zeros. 

Definition 4.10. Consider a measurable space T, and a family of cx-finite measures (pt)t^T on 
T x (0, +00), such that, for every t G T, p t (T x •) G . Fix t G T and fo > 0. A tree-indexed 
CSBP with types, with reproduction laws (p t )teT, started from (to,fo), is a process (Z u ) u£ u 
indexed by the universal tree U, with values in T x (0, +00), such that 



D The types being what we called populations, but we wish to give the most general definition here. 
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• 2,0 = (t , fo) a-s.; 

• for every k G Z + , conditionally on (Z u ,u G IX, \u\ < k), 

- the sequences (Z U j)jg^, for |w| — /c, are independent; 

— for \u\ = k, writing Z u = (t, f), the sequence is distributed as the family 
of the atoms of a Poisson measure with intensity fp t , where atoms are repeated 
according to their multiplicity and ranked in the decreasing order of their second 
coordinate. 



Notice that the branching property holds with respect to the second variable, that is, the 
independent sum of a CSBP with parameters (to^fo; (Pt)) an d one with parameters (t ,¥ , (p t )) 
is a CSBP with parameters (to, fo + f , (pt))- 




Figure 8: The tree of isles from Figure 10, and its reordering. 



Let us define the object we shall study. As before, we now call the types "population", the 
vertices "islands" and the descendants "colonies". We have a forest of i.i.d. trees of isles 

(A\r N ),...,A N (r N )). 

We root these trees at 1, . . . , N, and link them to 0, to which we give population r^. We call 
the tree obtained and once again, we give population to the islands in UAF^. We 
define the tree F^^ by reordering the colonies of each island, along with their subtree, in the 
decreasing order of their population, leaving unchanged the initial order if ties occur: see Figure 
8. The following result deals with its convergence. We may actually prove a slightly stronger 
result and to this end, let us first introduce the 1i-indexed processes Z^ and Z^'^ defined by 

Z W = ( F f)/iV 2 , k u (¥^)/N) , Z&"W = (F^'V^ 2 , k u (¥^)/N) , ueU, 

where we recall that k u (¥^) is the number of colonies of u in F^, or in formulas, 

k u (^ N) ) = #{j€N, FgVo}. 
We shall prove the following result, where we denote qi the projection on the first coordinate. 
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Theorem 4.11. The process (Z^ N ''^) converges as n — » +00, in the sense of finite dimensional 
distributions, to Z, where Z is a tree-indexed CSBP with types, started from (c, 1), with re- 
production law X(fi c + 5 C ) ® 6 for the type c, and 5o <S> So for the others. In particular (F^^) 
converges to q\(Z). 

It is worth noticing two features of this result. First, one cannot construct the limit of 
(F^^) directly. To this end, we need first to construct Z, and then project it on the first 
coordinate; but the knowledge of the second coordinate is necessary to get the whole process. 
This is why we prove the more general convergence of the two-coordinate process in order to 
obtain the convergence of (F^^). 

Note also that this result agrees with Theorem 4.5, in that the measure generated by the 
tree Z (except for his root) 

V' ■= J2 ( W„) 1 {9i(W0} 
uEU* 

has the same law as the measures of Theorem 4.5. Indeed, similar calculations as in the proof 
below and the proof of Proposition 4.9 can be carried out to show that the cumulant of rj 
solves Equation (4). However, since we only show a result dealing with the convergence of 
finite-dimensional marginals, we cannot deduce Theorem 4.5 from Theorem 4.11. Doing this 
would require to introduce a relevant topology on the tree-indexed CSBP (with types or not) 
and prove the tightness results associated. This technical and long detour would not bring, we 
believe, much more understanding of the model. 

4.5.2 Some results about Poisson random measures 

Let us start with three preliminary lemmas. The first is essentially a classical fact (see e.g. [4]), 
and merely rephrases Theorem 16.18 in [12]. 

Lemma 4.12. Let (u^) be a sequence of probability measures (0, +oo), and assume that 

as N — > +00, for some v G M+ . Let, for each N, )i>o a sequence of i.i.d. random 

variables with law v^ N \ and take, for f > 0, (a^(f), . . . , a^(f)) the reordering of {Y% )i=i...m 
in the decreasing order. Then, for every fixed k > 1, 

(aS 7V) (f),...,ai JV) (f))^(ai(f),...,a fe (f)), 

where (aj(f))j>o is the reordering in the decreasing order of the atoms of a Poisson random 
measure with intensity iv. 

Consider in the following v e , and let (aj(f))j>i as in the previous statement. To make 
sense of the coming results, and since it is of use in the proofs, let us recall how we can construct 
a Poisson measure with intensity iv in a measurable way. Let (Ai)i>i be a partition of IR + in 
measurable subsets of finite i/-measure. Define for i > 1 such that v(Ai) > 0, 

\ t = u(Ai), Ui = u(-nA t )/\ t . 

Independently for each i, let (iV 4 (f))f> be a Poisson process with intensity A, and (XJ)j>i a 
sequence of i.i.d. random variables with law 2^, and define 

N i ({) 

j=l i>l 
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Then £(f) is a Poisson random measure with intensity fz/. 

Lemma 4.13. For every continuous g: IR + — > R with compact support and every r£N, the 
mapping 

f ^ E(s(a r (f))) 

is continuous on M + . 

Proof. Let e > 0. The measure is in , so a := ^((e, +00)) is finite. Then a r (f) > e if and 
only if there are r or more atoms of the Poisson measure in (e, +00), so 



(a r (f) G (£,+00)) = 5> 



-fa 



(fa) fc 



fc! 

which is clearly continuous in f. Then, the complimentary probability P(a r (f) G [0, e)) is also 
continuous. The result thus holds for any indicator function, and the result follows by standard 
approximations. □ 

Lemma 4.14. In the notation of Lemma for every continuous g: 1R + — > R with bounded 
variation and compact support, and every r GM, 

E( 9 ( a W(f)))^E( 5 (a r (f))) 

uniformly on the compact sets of~R + . 

This is in particular true, and that is all we will use, for a Lipschitz-continuous g with compact 
support. 

Proof. The function g has bounded variation, so it may be written as g = g + — g~, where g + 
and g~ are nondecreasing and continuous. Take A > and consider the mappings 

0±:f^E(^(aW(f))) 

on [0, A). It is obvious, in the construction above, that f h-» a^(f) is nondecreasing, thus so 
do 0*. Lemma 4.12 ensures that a^(f) — > a r (f), so by dominated convergence, 0^ converges 
simply to ± , where 

± (f)=E(^(a r ,(f))). 

Now, Lemma 4.13 shows that <f> are continuous, so the result follows from Dini's second 
theorem. □ 



4.5.3 Proof of Theorem 4.11 

The proof will rely heavily on Lemmas 4.2 and 4.4, which we will use without further notice. 

1. Let us start with some preliminary definitions. Recall that we say that u G F^ if F^ 7^ 0. 
For u G F^, F^) is called its population, instead of type, and k u (W( N >)/N its fertility. An 
island in of population r^ N ' := r^/N 2 is said to be fertile. For a function Z indexed by IX 
and u G U, Z u+ is the function obtained by shifting the subtree rooted at u back to 0, so for 
instance, Z* + = Z u . 
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Let be the set of functions from U to (M + ) 2 . For k G Z + , we define Ek the set of functions g 
from J to IR + which can be written as 

g{7)=gu{7 <l) )g 1 {7 1 )...g s {7 s )g ll {7 ll )...g s ... s {7s...s) (5) 

for every T G 5", for some s6N, where the g u 's are Lipschitz-continuous with compact support 
in (R + ) 2 , and the last index consists of k letters s. In particular, such a g depends only on the 
k first generations. If T consists only on a root with population p, that is if T0 = (p, 0) and 
7 U = (0, 0) for u G U*, then we denote g(p) := g(7) = g$(p, 0). 

2. In the sequel, we will need to consider F^(f) a tree constructed as F^, but with fN 
islands at generation 1 (so F^ has the law of F^(l)). We say that this tree has fertility f. 
More generally, we may take f to be random with some law u, and we write FW(i/) for the 
corresponding tree. We may also construct as above F^'^z/), Z( N >(is) and Z^'^^v). Rather 
than E,(g(Z^ , ^(i'))), we may write E u (g(Z^)). We need these trees because of the following 
observation: consider a fertile island u G F^(i/), u 7^ 0. Then u has a fertility chosen according 
to 7^, so the branching property shows that F^ ,u+ has the law of F^(7^), and Z^ N ' ,U+ 
has the law of (7 W). 

Now, take an island u G ¥^ N \u), and assume it is fertile with fertility f. Amongst ul, . . . ,ufn, 
each has independently a probability p^ to be fertile, so the number of fertile islands is binomial 
with parameters fN and pn- The fertile trees have the law of W^^^) , and the other ones 
have the law of the trivial tree with only a root and population chosen according to 7?^. Let 
us condition on u having k fertile colonies, and consider the populations Y} N \f) } . . . , Y f ^}_ k (f) 

of the non- fertile children, (^(f), . . . , Y^l k (f)) is then an i.i.d. sequence with law 7P 

Let us reorder the tree and take a look at the first s colonies of the root of F^'-^z/). Take 

a binomial variable with parameters fN and p^. For 1 < % < s, the analysis above shows that 

with probability ¥(Xf N ' ) A s = i), the following happens: 

• the % first islands are fertile; 

• the population of the s — i following islands has the law of (a^^f), . . . , a^](f)) where we 
denote (a^^f), . . . , a|^^(f)) the reordering in decreasing order of a (f N — i)-sample of law 

3. Let us now reason by induction, the assumption 7 T at rank r being that for every / G 
and every laws u^ N \ v on IR + such that ^ u, we have 

E u(N) (f(Z^))^E u (f(Z)), (6) 

where Z is defined in Theorem 4.11. This obviously implies the result by picking 

To is easy to check by weak convergence of to v and uniform continuity of f$. So assume 
CP r _i, and take g G E r , which we write 

g(Z)=g(Z )g l (Z 1+ )...g s (Z s+ ), 

with gi,...,g s e E r ^. 
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By the analysis above, we may condition the tree ¥^ N \u^) on the fertility of the root and its 
number of fertile children, to obtain 



E u(N) (g(Z (N) )) = /^(d%0(rW f)x 

(£P(X f (A ° = 8 )E ?) ( 9l (^))..J ?sl fe(^)) x 

Vi=0 

E(^ +1 (ai Af) (f)))...E(^ s (aK(f))) 
+ £P(X f (JV) = i)E~ (N) ( 9l (Z^)) ■ ■ .E~ m (g s (ZW)) 



i>s 
s-1 



E% w ( 9l (#'))...E~ m ( 5i (^)) : 



i=0 



J u^(di)g d ^ N \i)nxl N) = i)M(g i+1 (4 N \f))) . . .E^a^f))) 
+ /^ (rW,f)P(X f W >z)^(df). 



By the induction hypothesis CP r _i and Lemma 4.4, 

%w(ft(Z (JV) ))-^%(Z))- 
Define Xf a variable with Poisson law with parameter Af . Let us prove that the quantity 

J z,W(df)^ (rW,f)P(X f (iV) = i)¥{g i+1 {^\i))) . ..E(gM N lm (7) 



converges to 



z/(df)^(c,f)P(X f = i)E(</ i+1 ( ai (f))) . . .EQ^a^f))) 



(8) 



where (aj(f))j>i is the reordering in the decreasing order of the atoms of a Poisson measure 
with intensity f/i c . According to Lemmas 4.4 and 4.14, we obtain that 

E(g i+j (af } (f))) E(^.( aj (f))), j = 1 . . . s - i, 

uniformly on the compact sets of M + . Using this fact and the weak convergence of v^ N ' to u, it 
is easy to see that (7) has the same limit, if any, as 



J v( N \df)g $ (A N \f)F(X^ = z)E(<7 i+1 ( ai (f))) . . .E^a^f))). 
The difference between (9) and (8) is bounded, up to a constant, by 

/ fl (rW,f) - g $ (c, f) i/W(df) + sup \gt\ I i/W(df) P(X f (A ° = <) - P(X f = <) 



(9) 



(c,f)^)(df)-|^(c,f)z/(df) 



The first and last term tend to by uniform continuity of g$ and weak convergence of to 
v. For the second one, Le Cam's inequality [ ] gives 



P(X f 



(TV) 



P(X f = i) 
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where O is uniform in f, what ensures the expected convergence. By the same reasoning, we 
also have that 

J £ (rW f)P(X f (A ° > i)i/™(df) J g 9 (c,f)F(X f > i)i/(df). 
We hence get that E u{N) (g(Z {N) )) tends to 

J2M9i(Z))...E g ( gi (Z)) x J ^(c,f) I /(df)E(p i+1 (a 1 (f)))...E(p s (a fl _ i (f)))P(X f = i) 

+ Eg( 9l (Z)) . . . E e (g s (Z)) J g $ (c, f)P(X f > i)u(d£). 

By a similar computation as just done, we can see that this is precisely E u (g(Z)), what shows 
the result. 

5 Second model: regrowing resources 

We shall now present our second model of migration under constraints, which is arguably more 
natural than the first one. The only difference is the migration rule: now, we shall assume 
that individuals will migrate when there are too many of them living at the same time on the 
same island. In other words, we assume that each island has regrowing resources, enough to 
feed r people at the same time. If a birth happens when r people coexist on a given island, the 
newborns will migrate each to a different virgin island, containing r resources. 

To make this model more natural, we will now consider continuous time trees, so that we 
can precisely tell which birth event forces certain individuals to migrate. Otherwise, imagine 
that at some generation, more than r individuals coexist. We would then have to choose which 
children of which individual migrate. Doing this in a relevant way would force us to choose 
the migrant individuals uniformly at random, what would introduce some randomness in the 
construction of the tree of isles from a given (deterministic) tree, what we want to avoid. 

5.1 Tree of isles 

Start from a finite continuous tree T, such that two events (birth or death) do not occur at the 
same time (which is a.s. the case for a continuous Galton- Watson tree). Once again, to define 
the tree of isles, we just need to define its migrant children. To this end, consider the process 
(Nt) counting the number of people alive at time t, which we assume to be cadlag. Define r be 
the first time (if any) such that 

• for t E [0,r), N t < r; 

• N T > r. 

Then the N T — r rightmost individuals born at r will be migrant children. After this, cut off 
the trees rooted at these individuals, and proceed identically with the pruned tree, until it is 
impossible to find other migrant children. See Figure 9 for an example. 

This thus allows us to construct the tree of isles as explained in Section 2.2. An example is 
given in Figure 10. 
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Figure 9: A tree 7 and its labeling, for r = 4. The individuals in a circle remain on the island, 
those in a square migrate. Here P r (7) = 12 and C r {7) = 3. 



We shall now explain how to label our tree so that the relevant information, namely its 
population P(7) and number of colonies C(T), can easily be read on the corresponding explo- 
ration process (S n ). According to Section 3.1, to define this labeling, we only need to precise 
the (Markovian) rule on how to choose the next vertex to be labeled. 

We shall take the following rule H, defined for every line £. For such a line, consider the 
pruned tree 1^, for which we can define migrant individuals thanks to the above algorithm. 
Take i = £ + U £° U f a partition of i in 

• the children of migrants individuals, 

• the migrant individuals 

• and the other individuals. 
Then 31 is defined as follows: 

• if £ + 7^ 0, then 5l(t) is the individual in £ + which will die the sooner; 

• if £ + = and £° ^ 0, then Jl(£) is the individual in £° which will die the sooner; 
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Figure 10: The tree of isles A(r) drawn from the tree 7 of Figure 9, for r = 4. 

• else 51(1) is the individual of i~ which will die the sooner. 

By construction, this is clearly a Markovian rule. 

This algorithm is just a modification of the death- first search algorithm. Informally, we 
apply the latter until we observe more than r people coexisting, say r' of them. When this 
is the case, r' — r newborns migrate, and for the sake of definiteness, we choose the r' — r 
rightmost ones. We then explore their subtrees one after another by death-first search, starting 
with the migrant individual who dies first. When all of these subtrees are explored, we resume 
the exploration of the initial tree by death-first search, until we see other migrant individuals 
or the exploration is over. All of this is probably clearer on a picture, see Figure 9. 

To this labeling, we may thus associate a walk (S n ). As shown on Figure 11, let us define 

o-i(r) = 0, = inf{n > 1, S n = -1} 

where as usual inf = +oo, and successively 

ft(r) = inf{n > a h S n > r - 1} A q^, er i+ i(r) = inf{n > ft, S n = r - 1} A fto, 

so that the walk first hits [r, +oo) at ft(r), then makes excursions 6 above r on each interval 
[ft(r), a i+ i(r)), and below r on each interval [a i+ i(r), ft +1 (r)), before hitting —1 at q^. To have 
convenient formulas, we have let all the o~i(r) and ft(r) to be equal to ft*, after the walk has hit 
— 1. Defines as well 

O i (r) = (5 ft(r) -(r-l))V0, i>l 
be the overshoot above level r — 1 (and by definition after —1 is hit), and 

ii(r) = ft(r) - (Ti(r), i > 1, 

so the sum of the £i(r)'s is the time spent below r — 1 before hitting —1. The relation between 
the population and number of colonies of the initial island and the corresponding exploration 
process is the following. 

Lemma 5.1. The equalities 

+oo +oo 

W) = £4(0, c P (oo = E °<(0 

1=1 2 = 1 

hold. 

6 We shall use this word in a very loose sense, only the precise definitions should be taken as rigorous. 
However, and thankfully, "Brownian excursion" will have its usual meaning. 
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Figure 11: The exploration process attached to the labeling of the tree of Figure 9, for r = 4 
(forgetting about the extra notation r). Check that 0i(r) + 0a(r) = 3 = C r (T) and £i(r) +^2( r ) + 
£ 3 {r) = 12 = P r (T). 



In words, P r (T) is the time spent by (SVJ under r — 1, and C r (T) the sum of the overshoots 
above level r — 1, both before hitting —1. The quantities involved are easy to follow on Figures 
9 and 11, and carefully following in parallel the walk and the tree on these examples should 
make the following proof quite obvious. 

Proof. As explained above, our labeling algorithm works as follows. As long as there are no 
migrant children, it is just the death-first search algorithm. As we mentioned in Section 3.2, 
1 + Si is precisely the number of individuals alive between the (i — l)-th and the 2-th event. 
Hence, the first time i that (1 + S n ) is greater than r is the first time when more than r people 
coexist. The supernumerary 1 + Si — r migrate, and this quantity is precisely the first overshoot 
Oi(r), whereas the i = £i(r) first individuals visited remain on the initial island. 

Then, we modify our algorithm to explore (by death-first search) the Oi(r) subtrees of the 
migrant children. According to the first part of Lemma 3.2, the first exploration goes from 
1 + Si to 1 + Si — 1 while remaining above 1 + S$ — 1, the second from 1 + Si — 1 to 1 + Si — 2 
and remains above 1 + Si — 2, . . . , and the (Di(r)-th and last from 1 + Si — Oj(r) + 1 = r to 
1 + Si — Oj(r) = r — 1 while remaining above r — 1. Hence, when all these explorations are 
done, no new overshoot has been observed, and no time below r has been cumulated. These 
subtrees have thus all been visited, and we may just as well cut them off. 

The exploration of the initial tree then resumes, and the same argument can then be applied 
to the pruned tree, so we can thus conclude by a simple induction on the size of the tree. □ 

From now on, we replace T by T, a Galton- Watson tree with reproduction law p and 
construct its tree of isles A(r) for r resources. We also replace (S n ) by an actual random walk 
defined on the whole of Z + , with step distribution p, so Lemma 5.1 can be rewritten as an 
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equality in law. Unlike the first model, an island with colonies can have an arbitrarily large 
population. Letting 7r r to be the law of the pair (P r (T), C r (T)), we cannot anymore write n r 
as (roughly) a product measure. It should be clear than an analogue to Lemmas 4.2 and 4.3 
holds. We call a N-type Galton- Watson tree (resp. process) a multitype Galton- Watson tree 
(resp. process) with types in N, and we still adopt the notation of Section 2.3. 

Lemma 5.2. The tree of isles A(r) is a N-type Galton-Watson tree, such that the population 
and number of colonies of each island has law Tr r . Similarly, the process (Z itP (r), p > l)j>o is 
a N-type Galton-Watson process, such that the type and number of children of each individual 
has law ~K r . 



5.2 Result 

As in Section 2.3, we now start from N independent islands and r^ resources, and employ the 
same notation. Let us first define, for a real function X and a real x, t x (X) to be the hitting 
time of (x, +oo) by X, i.e. 

r x (X) = inf{t > 0,X t > x}, inf0 = +oo. 

Let Cn = C rN (T) and Pn = P rjv (T). From Lemma 5.1, the probability that an island has 
colonies is 

F(C N > 0) = P(r rjv _ 1 (S) < ?00 ) ~ — . (10) 

r n 

The last equivalent is an extension of the classical gambler's ruin estimates, and follows e.g. 
from Theorem 2, p. 18, in [. '] and a Tauberian theorem (p. 203, 204 of [. ]). We shall thus 
assume that, for some c > 0, 

lim ^ = c > 0. (11) 

/V->.+oo N 

Beware that, as we mentioned, this is a different rescaling from (2) for the first model. We shall 
prove the following, where c := c/a and we recall that /i is a measure on (0, +oo) with density 
l/(2x 3 / 2 ). 

Theorem 5.3. Under the assumption (11), the sequence {"P^^n)) converges in distribution 
in M + to a random measure 7, characterized by a cumulant n(f), unique solution to 

exp -«(/) = A ^ + °° (l - e- f{x) ) fi(dx) + (l - e - f ^- K ^ c ) (12) 

for every f £ Ck, where (P, C) is a couple of random variables with Laplace transform 

( \/2nc \ 2 1 

E(exp -aP -PC)=[ r— 7= /=• ( 13 ) 

\sinhv2o;c/ f3c + v2accoth y2ac 

We shall follow the same route as for Theorem 4.5: write an equation for the cumulant of 
y {N \r N ), study its convergence, and prove that it has a unique solution. Obviously, we also 
need to prove the tightness, but it is obtained as for Lemma 4.6. 
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5.3 Interpretation 

As for the first model, let us try to give a reasonable interpretation of the limiting measure 7. 
First, let us study (P, C). To this end, notice that we may rewrite 

E(exp -aP - (3C) = / + °° -e~ x / c e~^ ( ) exp -x- ( - l) dx. 

Jo c \8inhy/2acJ c\th^2lxc J 

This makes clear the following facts. 

• C has an exponential 8,(1 /c) law (surprisingly, this quantity does not depend on the 
second moment of p). 

• Conditionally on C, P has the law of the sum of three independent variables: 

— two whose law has Laplace transform 

\/2ac 



sinh V2ac 

a third one whose conditional Laplace Transform is 




Note, and the reason for its appearance will be clear in the proofs, that y2ac/ sinh y2ac is 
the Laplace transform of the law of the hitting time of c by a Bessel 3 process. This distribution 
has a complicated density given by Theta functions, and we shall thus not dwell on this matter. 

Let us then construct a random measure rj as follows. We first build a tree T with fertilities, 
which is nothing else than a Galton- Watson tree with types in IR + . Each individual of fertility 
f has a number of descendants distributed as a Poisson distribution with parameter f . Each 
of these children chooses independently a fertility which is exponential 8.(1 /c). Conditionally 
on its fertility f, we attach to each individual a population P(f) distributed as a variable with 
Laplace transform 

E(exp-aP(f)) = [ . exp-f- 1 / v 2 ' W ^ 



k sinh \p2acj ' c \\h_\p2ac J 

We start from a (virtual) individual of fertility 1 and population, say, 1. We have then con- 
structed a tree T, to each vertex v of which a fertility f v and a population P v is attached. One 
readily checks that the reproduction law is critical and not 8i, so this tree is almost surely finite. 

Then, consider (r/ u (f),f > 0) u eu a family of independent variables, such that T) u (f) is a 
Poisson random measure with intensity fXfi. Define 

V = J2(Vu{Q +S Pk ) -8 X . 

Subtracting 8\ just means that we do not take into account the population of the virtual initial 
individual. The reader will check, as for the first model, that the cumulant of rj solves the same 
equation (12) as the cumulant of CP, so that these two measures have the same law. 
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5.4 Equation of the cumulant 

Let us first notice that, as for the first model, the cumulant of CP^(rjv), given by 

K N (f) = -lnE(exp-(T^ ) (r iV ),/)) = -AHiiE (exp -(7^ N \r N ), f)] 

for / G C#r, also solves Equation (3). The proof is identical. 
Lemma 5.4. The cumulant kjv(/) solves the following equation 



exp -**(/) = E (exp - (/ (^) + —M/)))" (14) 



/or every / G C^- 



We now assume that «jv(/) — ^ K (/)- Once again, the RHS of (14) has the same limit, if 
any, as 



iVE(l-ex P -(/(^) + <£ KN{ f) 



iVE((l-exp-/ f^lll^o} 



\N 2 , 

+ N¥(C N > 0)E (l - exp - (/ + ~K N (f)) \C N >0 

The first part is easy to deal with. Note indeed that by the Cauchy-Schwarz inequality, 

ME ((l - exp-/ (^)) 1 {Cjv=0} ) 2 < NF(C N = 0)E ((l - exp-/ (^))^ . 

By (10) and (11), the first term in the RHS is bounded. By Lemma 5.1, Pjy < W>, so P^/N 2 — > 
a.s. and the second term thus tends to by dominated convergence. Hence 

iVE ((l - exp -/ (^)) 1 {Cjv=0} ) ~ NE (l - exp -/ ^(l - e^Udx) 

as for Lemma 4.4. On the second hand, note that by (10) and (11) 

NP(Cn >0)E (l - exp - (/ (^) + ^(/)) 



i E (l-«p-(,$)-£M/) 



C N > 0] . 



We are thus led to compute the weak limit of the couple of random variables (Pn/N 2 , Cn/N) 
knowing that CV > 0. This is more involved than for our first model, and this will be the goal 
of the next section. 



5.5 Population and number of colonies of a fertile island 

We shall now give a formula for the population and number of colonies of a fertile island, by 
proving the following result. 
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Proposition 5.5. Conditionally on {Cn > 0} ; the variable (Pn/N 2 ,Cn/N) converges in law 
to a variable (P,C) whose Laplace transform is given by (13). 

Remark 5.6. In the proof, we shall actually assume that r^ = \_cN\ . The results hold without 
this restriction, but writing the proofs would then require some more pages of technical convo- 
lutions on which we do not wish to dwell. The essential ingredient to get rid of this restriction 
is to sandwich our processes, depending on r^/N, between two processes depending onc-£ 
and c + e, where e > can be chosen arbitrarily small, and to see that these two processes 
actually converge to the process depending on c when e — > 0. Depending on the cases, this is 
due to the absolute continuity of the laws considered or the continuity of the local time in the 
space variable. 

The proof of this result will be twofold. First, (P N /N 2 ,C N /N) can be reformulated, thanks 
to Lemma 5.1, in terms of functionals of a critical random walk with a second moment. At 
the limit, this quantity can thus be written as a functional of the Brownian motion. More 
precisely, we wish to condition on C^ > 0, which corresponds to conditioning this random 
walk to hit [rjv, +oo) before —1. The limit will thus actually be in terms of a functional of a 
Brownian excursion, conditioned on hitting (c, +oo). The second part of the proof is to compute 
the Laplace transform of this functional, which can be done through Williams' decomposition 
theorem of the excursion, along with a Ray-Knight theorem and some results of Pitman and 
Yor [J 9] concerning the Laplace transforms of functionals of Bessel bridges. 



5.5.1 Convergence to the excursion measure 

Let us first write P^ and Cjv in terms of a functional of the random walk (S n ), and to make 
this precise, we shall introduce some more notation. Let £ be the space of excursions, that is 
of nonnegative cadlag functions on IR + such that, for every e G £, 

C(e) := sup{t > 0, e t > 0} G [0, +oo). 

This space is endowed with the distance 

<5(e,e') = sup |e t - e' t \ + |C(e) - C(e')|, 

t>o 

which makes it a Polish space, see [16]. Let D° be the space of cadlag functions on R + vanishing 
at 0. For d > and / G D°, recall that r d (f) is the hitting time of (d, +oo), and let 

r d ,_(/) = inf{t < r d (f), f{t) = & Vs G [t, r A ) f{s) > 0} 

and finally 

T d , + (f) = mf{t>r d (f)J(t)<0}. 

These quantities are depicted in Figure 12. 

We let D° be the subset of D° such that r d and r d>+ are finite. For (X, Y) G B° d x D°, we 
define e d the function which extracts from X the first excursion which goes above level d, and 
shifts Y accordingly, in formulas: 

e d (X,Y) = [X {Td _ {X )+t)AT dr+ (X) V0, Y {Tdt _ {X )+t)Ar dy+ (X) ~ Y r d , + (x)) t>Q ■ 
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Figure 12: The quantities Td-(f), Td(/) an d T d,+(f) shown on the exploration process of Figure 
11. 



Clearly, e d (X, Y) e £ x D°. Let us finally define, for (X, Y) e £ x D°, 

rCPO 



( r<- 



^{x s <d} ds, Y ax) E 



which computes the total time spent by X below level d, as well as the value of Y at the final 
point of X. This is probably a good time to state the important feature of these mappings. We 
let here W be the law of the Brownian motion (which is in particular a law on D^). 

Lemma 5.7. • For W-a.e. X and every Y 6 D° ; the mapping ed ■ D|] x D° — > £ x Do is 

continuous at (X, Y) . 

• For any X e £ and continuous Y e D° ; the function <pd is continuous at (X, Y). 

Proof. The first part of the statement just stems from the fact that a.s., when hitting a value, 
the Brownian motion oscillates around it. Hence, the times Td~, r d and are continuous 
at W-almost every X, and the continuity of Cd then readily follows from the continuity of the 
Brownian motion. The second part is trivial by definition of the distance 5 and dominated 
convergence. □ 

From Lemma 5.1, it is now natural to define 

Lt = D Of (tn) 

i, qi<t 

be the sum of the overshoots above level rjv of (S n ) up to time t > 0. Now, let us define our 
rescaled random walk 

s[ N) = \ T s NHl t>0, 

o~N 
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and the rescaled overshoot process 




1 



aN 



t > 0. 



Recall that c = c/a. We may then reformulate Lemma 5.1, by stating that, whenever Cn > 0, 



We are now in a good position to state the main result of this section, which is quite similar 
to the main result of [16]. To this end, recall from Williams' description of Ito's measure (see 
e.g. Th. 4.5 p. 499 in [20]) that the Ito measure of the set of excursions with a maximum 
greater than c is finite. Hence, we may define n >c a probability law on £, which is the law of 
an excursion conditioned on having a maximum greater than c. This is in particular a semi- 
martingale, and we can thus define its local time. More generally, for a semi-martingale X, we 
let £f{X) is its local time at level a up to time t, and we obviously consider a modification of 
(£t(X), a£l,t>0) which is a.s. continuous in t and cadlag in a, see [20]. 

Lemma 5.8. The convergence in distribution 



holds, where e has law n >c . 

Proof. This is obviously obtained by passing to the limit in (15), though this requires some 
care. It should be intuitively clear that (S^ N \ L^) converges to a Brownian motion and its 
local time at c, which is precisely the content of Theorem 1.3 of [18], which we may reformulate, 
if we are careful of the different normalization from [20] , as 



weakly in D([0, +oo), 1R 2 ), where B is a standard Brownian motion. One should just take note 
of the two following facts: 

• our walk is left-continuous, so the definition of we give is precisely Formula (1.2) in 
[18], with x = c; 

• the only slight difference is that Theorem 1.3 of [ 8] deals with versions of the random 
walk and the local time which are linearly interpolated, unlike ours, but since the limit 
is continuous, and hence the limit holds for the topology of uniform convergence on the 
compacts, it clearly does not make any difference. 

Now, the latter, the continuity of ea and the continuous mapping theorem ensure that 



weakly in £ x D . But the definition of eg ensures that the first coordinate of ec(B,a£ c (B)/2) 
is the first excursion of a Brownian motion which goes above level c, and has thus law n >c . 
The second coordinate is a/2 times its total local time at c. The result then follows from the 




(15) 




(16) 





continuity of 



□ 
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5.5.2 Time spent under a level and local time of an excursion 

The last part of the proof of Proposition 5.5 is then to compute the Laplace transform of the 
RHS of (16). Hence, from now on, e is an excursion conditioned on having a maximum greater 
than c, that is a process with law n >c . We let 

(P,C)=U l {es <a } ds, 2 4(e) J . 

Lemma 5.9. The variable (P,C) has Laplace transform given by (13). 

Proof. Let us explain how to construct such an excursion e. From Williams' decomposition 
(see [20, Th 4.5 p. 499]), conditionally on its maximum M, which has a "law" with density 
l/(2x 2 )l{ x> oj, the Brownian excursion has the law of two independent Bessel 3 processes, until 
they hit M, put back-to-back. Hence, if R is a Bessel 3 process, then 

r+oo 1 / rr x (R) R a _ \ 2 

E(exp -aP - f3C) = 2c J_ ^E f exp -a J q t {Rs<d} ds - ^ )( . dx (17) 



and we are thus led to compute, for a fixed x > c, 



( f Tx{R) 
g(a, p) := E I exp -a J l{/?. fl <£} ds 



2 l MR) j ' 



Now, recall that the Bessel processes have the Brownian scaling property, i.e. R has the same 
law as Q = (xRt/ x 2) t > , so the occupation time formula provides 

and thus, still by Brownian scaling, 

rr x (Q) (3 a 



g{a,P) = E I exp -a ^ l {Q<c} ds - ^t Tx{Q) (Q) 



E ( exp -a I 1 {R s/x 2<c/x} ds ~ ^ x %5r)( r ) 



o 



= E |^exp -ax 2 t{ Rs< t/x} ds - -y-^^ fl) (-R)j • 

Hence, we are now led to compute, for u, v > and < r < 1, 

/ fri{R) \ 

h(u, v) := E I exp -u l{ Ru<r } du - vt Tl{R) (R) I . 

By the occupation time formula (note that the quadratic variation of a Bessel process is t), 

h{u,v) =E(ex P -« f o r t u Tl{R) (R) du - v£ r Ti(R) (R)^ 
which we may rewrite as 

E^exp-^^Md* 
34 



where u(dx) = ut[o )r ](dx) + v8 r . From a Ray-Knight Theorem', the process (£^(R), x G [0, 1]) 
is a squared Bessel 2 bridge, and thus, the latter quantity is precisely computed in Prop. 5.10 
in [19]. After easy but long computations, one obtains 



E exp — 



i 



t Tl{R) {R)v{dx)) 



V2u 



(1 — r) (2v sinh \/2ur + y/2u cosh \/2ur) + sinh y/2ur 



Hence 



g(a,P) 



x^/2a 




sinh y/2ac 1 + (x — c) (j3a + y/2a coth \/2ac) 



The result then readily follows from plugging this formula in (17). 



□ 



Remark 5.10. Recall the interpretation we gave of the variable (P,C) in Section 5.3. This 
formulas can thus be interpreted by saying that the excursion can be split into three parts: a 
part between and the hitting time of c, which has the law of a Bessel 3 process. Another 
part between the last hitting time of c and 0, with the same law. And in between, there is an 
exponential "quantity" of excursions above and below c; each contributes independently to a 
microscopic random amount, given by the above Laplace transform, to the time spent below c. 
See also the next section for a discrete version of these heuristics. 

Proposition 5.5 then follows from the result just proven and Lemma 5.8. To conclude the 
proof of Theorem 5.3, all we need to check now is that Equation (12) has a unique solution 
which is done as for Theorem 4.5. 

5.6 Another way to the result 

To conclude, let us present another way to compute the limit of (P^/N 2 , C^/N) knowing that 
Cn > 0. This method is more elementary but requires more steps, and might be seen as more 
natural - at least to the author, who firstly used it to derive the result. Once again, the goal 
is to compute the time spent under rjv, and the sum of the overshoots, both up to the hitting 
time of —1, for the random walk (S n ) conditioned on hitting [rjv, +oo) before —1. 

To obtain such an excursion of the random walk, we wait until we see (S n ) hit [rjv, +oo), 
and we consider the excursion from to —1 straddling this time. We can cut this excursion in 
three pieces: 

• a first piece, where the walk goes from to [r^r, +oo); 

• a third piece, where the walk goes from to — 1; 

• a second piece in between these two, where the walk goes from to without going 
back to 0, a certain amount of time. 

7 Surprisingly hard to find in the literature: it can be seen as a consequence of Theorem 4 in [23] along with 
the Williams' decomposition, Theorem 3.11 in [ ]. This statement is also given in [17], p. 42, along with a direct 
proof. 
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By the same reasonings as above, it should be clear that the first piece, after rescaling, converges 
to the first piece of a Brownian motion going from to c while remaining positive. From 
Williams' decomposition, Theorem 3.11 in [20], this has the law of a Bessel 3 process R, and 
thus the time it takes to hit c has the law of Tc(R), which has, as we mentioned, Laplace 
transform 

\PlOLC 



sinh \/2ac 

The third part obviously accounts for the same independent quantity. 

Now, let us study what happens in the middle. We may forget 8 about the time for the 
random walk to go from [rjy, +oo) back to — 1, and first consider the walk at — 1. From 
Formulas (3) p. 187 and (b) p. 181 in [21], the size of an overshoot has finite mean cx 2 /2, so 
(11) and a gambler's ruin estimate (see Lemma 5.1.3 in [ ]) show that the walk goes to — 1 
before coming back at with probability a 2 /(2cn). Hence, if we let Ln to be the number 
of such excursions from rjv to without going back to —1, we deduce therefrom that L^/N 
converges to a variable L with an exponential law £(cx 2 /(2c)). 

Now, conditionally on L, i.e., loosely, pa LN, the number of colonies Cn is the sum of 
Ln independent overshoots, and thus, by the law of large numbers, Cn/N converges to L times 
the mean size of an overshoot, i.e. C = a 2 L/2. 

Finally, still conditionally on L, we want to know the time spent below by this piece of 
the walk. At the limit, after rescaling and by similar reasonings as in the above section, it can 
be seen as the time spent below by a Brownian motion 

• up until it has accumulated a local time <jL at 0, 

• and conditioned on not hitting — c before this time (what has positive probability). 

We thus have to compute this quantity. But to construct a Brownian motion up to a local time 
of £, all we need is a Poisson point process (e s ) se [o/j with intensity n, the Ito measure, and then 
glue these excursions together (see [! 0]). Hence, by thinning, to construct such a conditioned 
Brownian motion, we take a Poisson point process (e a ) ae ro ^ with intensity n>_c, the restriction 
of n to the excursions with minimum greater than — c, and then glue these excursions together. 
Let R(e) be the length of an excursion e. Then the time spent by this conditioned Brownian 
motion under is 

^ -R(e s )l{inf e<0} 
0<s<^ 

and, by symmetry and using the exponential formula for Poisson point processes ([20], p. 476), 
for a > 0, 

Elexp-a ]T R{e s )l {iQie<0} ) = exp -£ [ {1 - e - aR(n) )l {inf u<0} n>_ c (d M ) 
\ o<s<e J J 

= exp —£ J(l — e~ aR ^)t {0<supu<£} n(d«) 

PC /"-foe 

= exp -£ / (1 - e~ ay ) m(dx, dy) 

Jx=0 Jy=0 



8 However, this relies on the walk having a second moment. Otherwise, it could have a macroscopic jump 
from below to above. 
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where m is the image of n (or n + , the Ito measure of the unsigned excursion) by e i— > 
(supe, R(e)). Once again, Williams' decomposition of the Brownian excursion tells that its 
maximum M has "law" 1/ (2x 2 )1j x> o\ dx, and that conditioned on this maximum, it has the 
law of two independent Bessel 3 processes R and R' put back to back. Therefore 




Putting the pieces together, this is just saying that, conditionally on L = 2C/a 2 , the variable 
P has Laplace transform 

/ V2ac \ 2 ^ 1 / V2ac \ _ / V^c \ 2 1 / V^c \ 

\sinh\/2ac/ 6XP ° 2c\thV2ac j ~ \ sinh V2ac ) &XP c\thV2ac ) 

which is precisely what we remarked after Theorem 5.3. The main advantage of this method 
is thus probably that it provides directly the law of P knowing C, which is not obvious when 
merely looking at Formula (13). 
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